IIT Kharagpur at TREC 2008 Blog Track
نویسندگان
چکیده
This paper describes our opinion retrieval system for TREC 2008 blog track. We focused on five different aspects of the system. The first module is focussed on extracting the blog content out from junk html and thereby decreasing the noise in the indexed content. The second module aims at removing various kind of spam content from real blogs. The third module aimed at retrieving the relevant documents. The fourth module filters out opinionated documents and the fifth one calculated the polarity of the sentiments in the document. The final ranked retrieval runs were based on various combination of settings in each module so as to study the effect of each. For classification of subjectivity and polarity, the predictions we done using a complementary naive bayes classifier
منابع مشابه
Query Representation and Understanding July 28 , 2011 , Beijing , China Organizers
Understanding the user needs underlying a query can be very difficult, even for a human relevance judge. When evaluating our algorithms, particularly those with a sophisticated query model, it may be wise to use real queries and a notion of relevance that is aligned with real user needs. I will present two lines of work in this area. One is the TREC Web Track, where we attempt to incorporate re...
متن کاملUniversity of Lugano at TREC 2008 Blog Track
We report on the University of Lugano’s participation in the Blog track of TREC 2008. In particular we describe our system for performing opinion retrieval and blog distillation.
متن کاملOn the TREC Blog Track
The rise of blogging as a new grassroots publishing medium and the many interesting peculiarities that characterise blogs compared to other genres of documents opened up several new interesting research areas in the information retrieval field. The Blog track was introduced in 2006 as part of the renowned Text REtrieval Conference (TREC) evaluation forum, to drive research on the blogosphere an...
متن کاملTHUIR at TREC 2008: Blog Track
This is the second time we participate in TREC Blog Track. There are three main tasks in the track, relevant finding task, opinion finding task and polarity task. In this year, we use multi-field relevance ranking in relevant finding task; and in opinion finding task, we focused on the combination of relevance score and opinionate score use a unified generation model; in polarity task, we devel...
متن کاملDCU at the TREC 2008 Blog Track
In this paper we describe our system, experiments and results from our participation in the Blog Track at TREC 2008. Dublin City University participated in the adhoc retrieval, opinion finding and polarised opinion finding tasks. For opinion finding, we used a fusion of approaches based on lexicon features, surface features and syntactic features. Our experiments evaluated the relative usefulne...
متن کامل